Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 1022420090010040127
Phonetics and Speech Sciences
2009 Volume.1 No. 4 p.127 ~ p.132
Effective Combination of Temporal Information and Linear Transformation of Feature Vector in Speaker Verification
Seo Chang-Woo

Zhao Mei-Hua
Lim Yong-Hwan
Jeon Sung-Chae
Abstract
The feature vectors which are used in conventional speaker recognition (SR) systems may have many correlations between their neighbors. To improve the performance of the SR, many researchers adopted linear transformation method like principal component analysis (PCA). In general, the linear transformation of the feature vectors is based on concatenated form of the static features and their dynamic features. However, the linear transformation which based on both the static features and their dynamic features is more complex than that based on the static features alone due to the high order of the features. To overcome these problems, we propose an efficient method that applies linear transformation and temporal information of the features to reduce complexity and improve the performance in speaker verification (SV). The proposed method first performs a linear transformation by PCA coefficients. The delta parameters for temporal information are then obtained from the transformed features. The proposed method only requires 1/4 in the size of the covariance matrix compared with adding the static and their dynamic features for PCA coefficients. Also, the delta parameters are extracted from the linearly transformed features after the reduction of dimension in the static features. Compared with the PCA and conventional methods in terms of equal error rate (EER) in SV, the proposed method shows better performance while requiring less storage space and complexity.
KEYWORD
speaker verification (SV), principal component analysis (PCA), delta cepstrum, Gaussian Mixture model (GMM)
FullTexts / Linksout information
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI)